BIOTECHNOLOGY < jj g^jg 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21 help(o),uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http;//ww.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto,gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcer Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



Revised 01/29/2002 



Raw Sequence Listing Error Summary 



R: 



FRROR DETECTED 

ATTN: NEW RULES CASES: 

1 Wrapped Nuclei cs 

Wrapped Aminos 



SUGGESTED CORRECTION SERIAL NUMBER: 

PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The numbcrAext at the end of each line "wrapped** down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to J; this will 
prevent "wrapping" 



2 Invalid Line Length The rules require that a line not eiceed 72 characters in length Thii includes white spaces. 

3 



Variable Length 



Patcntln 2.0 
"bug" 



Misaligned Amino The numbering under each 5* amino add is misaligned. Do not use Ub codes between numbers; 
Numbering use space characters, instead 

Non-ASCII The submitted file was not saved in ASCD(DOS) text, as required by the Sequence Rules. PW 

ensure your subsequent submission b saved In ASCII test 

Sequence^) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> scetioaihat some may be missing. 

A "bug" in Patenlln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patcntln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequences) missing If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X> shown) 
This sequence is intentionally skipped 

Please-also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 

Sequences) missing. If Intentional, please insert the following lines for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 
In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1 823 of Sequence tfules, the only valid <2 13> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/spedcs). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 

Use of <220> Sequences) missing the <220> "Feature" and associated numeric identifiers and responses. 

- Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artifidal Sequence" or 
- -Unknown" Please explain source of genetic material in <220> to <223> section 

(See "Federal Register," 06701/1998. VoL 63, No. 104, pp. 2963 1-32) (Sec 1.823 of Sequence Rules) 

_PatentIn 2.0 Please do not use "Copy to Disk" function of Patcntln version 2.0. This causes a corrupted file, 

"bug" resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 

listing). Instead, please use "File Manager" or any other nunuil means to copy file to floppy disk. 



7 Skipped Sequences 

(OLD RULES) 



8 Skipped Sequences 

(NEW RULES) 



_Use of n's or Xaa's 

(NEW RULES) 



_lnvalid<213> 
Response 



Misuse of n 



n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 
any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch -08/21/7001 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/0 5 6,4 54 



4 
6 

15 
16 
1 



Input Set 
^Output SsX 



A:\EP.txt 

N:\CRF3\02152002\J056454 .raw 



DATE: 02/15/2002 
TIME: 14:39:23 

Does N;v Cornon 
■;orrer'-H p-.; ;9tte j^c'er 



in or Relating to Plant Starch 



Suite 27 



SEQUENCE LISTING ~~~~~ 
(1)/GENERAL INFORMATION: 
(i) APPLICANT: 

TITLE OF INVENTION: Improvements 

Composition 
NUMBER OF SEQUENCES: 20 
CORRESPONDENCE ADDRESS: 
(B) STREET: 501 Silverside Road 
CITY: Wilmington 
STATE: Delaware 

COUNTRY: United States of America 
ZIP: 19809 
COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version 
CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/10/056,454 

(B) FILING DATE: 24-Jan-2002 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: National Starch and Chemical Investment 
Holding Corporation 




(C) 
(D) 
(E) 
(F) 



#1.30 (EPO) 




ERRORED SEQUENCES 



2 7 (2) INFORMATION FOR SEQ ID NO : 1: 
2 9 (i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 57 base pairs 

31 (B) TYPE: nucleic acid 

32 (C) STRANDEDNESS: single 

33 (D) TOPOLOGY: linear 

39 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

41 AAGGATJXOT-CGACATCGAT AATACGACTC ACTATAGGGA TTTTTTTTTT 
4 2/"t*TTTTTT 



57 



204 (Z) INFORMATIOTT^OR SEQ ID NO: 12: 

206 (i) SEQUENCE CHARACTERISTICS: 

207 (A) LENGTH: 3003 base pairs 

208 (B) TYPE: nucleic acid 

209 (C) STRANDEDNESS: single 

210 (D) TOPOLOGY: linear 

216 (xi) SEQUENCE DESCRIPTION: SEQ ID 



NO: 12: 



file://C:\Crf3\Outhold\V srJ056454.htm 
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r 



RAW SEQUENCE LISTING DATE: 02/15/2002 

PATENT APPLICATION: US/10/056,4 54 TIME: 14:39:24 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02152002\J056454.raw 

E--> 218 GATGGGGCCT TGAACTCAGC AATTTGACAC TCAGTTAGTT ACACTGCCAT 

219 CACTTATCAG 6 0 

E--> 221 ATCTCTATTT TTTCTCTTAA TTCCAACCAA GGAATGAATA AAAAGATAGA 

22 2 TTTGTAAAAA 120 

E--> 224 CCCTAAGGAG AGAAGAAGAA AGATGGTGTA TACACTCTCT GGAGTTCGTT 

225 TTCCTACTGT 180 

E--> 22 7 TCCATCAGTG TACAAATCTA ATGGATTCAG CAGTAATGGT GATCGGAGGA 

22 8 ATGCTAATAT 24 0 
E--> 230 TTCTGTATTC TTGAAAAAAC ACTCTCTTTC ACGGAAGATC TTGGCTGAAA 

231 AGTCTTCTTA 300 
E--> 23 3 CAATTCCGAA TCCCGACCTT CTACAATTGC AGCATCGGGG AAAGTCCTTG 

234 TGCCTGGAAT 36 0 

E--> 236 CCAGAGTGAT AGCTCCTCAT CCTCAACAGA TCAATTTGAG TTCGCTGAGA / J- 

23 7 CATCTCCAGA 420 ' " 
E--> 239 AAATTCCCCA GCATCAACTG ATGTAGATAG TTCAACAATG GAACACGCTA 

240 GCCAGATTAA 4 80 

E--> 242 AACTGAGAAC GATGACGTTG AGCCGTCAAG TGATCTTACA 

24 3 GGAAGTGTTG AAGAGCTGGA 54 0 
E--> 24 5 TTTTGCTTCA TCACTACAAC TACAAGAAGG TGGTAAACTG GAGGAGTCTA 

24 6 AAACATTAAA 600 

E--> 24 8 TACTTCTGAA GAGACAATTA TTGATGAATC TGATAGGATC AG AG AG AG GG 

249 GCATCCCTCC 660 

E--> 251 ACCTGGACTT GGTCAGAAGA TTTATGAAAT AGACCCCCTT TTGACAAACT 

252 ATCGTCAACA 720 

E--> 254 CCTTGATTAC AGGTATTCAC AGTACAAGAA ACTGAGGGAG GCAATTGACA 

255 AGTATGAGGG 7 80 

E--> 257 TGGTTTGGAA GCTTTTTCTC GTGGTTATGA AAGAATGGGT TTCACTCGTA 

258 GTGCTACAGG 84 0 

E--> 260 TATCACTTAC CGTGAGTGGG CTCCTGGTGC CCAGTCAGCT 

261 GCCCTCATTG GGGATTTCAA 900 

E--> 26 3 CAATTGGGAC GCAAATGCTG ACTTTATGAC TCGGAATGAA TTTGGTGTCT 

264 GAGAGATTTT 96 0 

E--> 266 TCTGCCAAAT AATGTGGATG GTTCTCCTGC AATTCCTCAT GGGTCCAGAG 

267 TGAAGATACG 102 0 

E--> 269 TATGGACACT CCATCAGGTG TTAAGGATTC CATTCCTGCT TGGATCAACT 

270 ACTCTTTACA 1080 

E--> 272 GCTTCCTGAT GAAATTCCAT ATAATGGAAT ATATTATGAT CCACCCGAAG 

27 3 AGGAGAGGTA 114 0 

E--> 27 5 TATCTTCCAA CACCCACGGC CAAAGAAACC AAAGTCGGTG AGAATATATG 

27 6 AATCTCATAT 12 0 0 

E--> 278 TGGAATGAGT AGTCCGGAGC CTAAAATTAA CTCATACGTG AATTTTAGAG 

279 ATGAAGTTCT 12 60 

E--> 281 TCCTCGCATA AAAAAAGCTT GGGTACAATG CGGTGCAAAT TATGGCTATT 

282 CAAGAGCATT 13 2 0 

E--> 2 84 CTTATTATGC TAGTTTTGGT TATCATGTCA CAAATTTTTT TGCACCAAGC 

285 AGCCGTTTTG 1380 

E--> 2 87 GAACGCCCGA CGACCTTAAG TCTTTGATTG ATAAAGCTCA TGAGCTAGGA 

288 ATTGTTGTTC 1440 

E--> 290 TCATGGACAT TGTTCACAGC CATGCATCAA ATAATACTTT AGATGGACTG 
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RAW SEQUENCE LISTING DATE: 02/15/2002 

PATENT APPLICATION: US/10/056,4 54 TIME: 14:39:24 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02152002\J0564 54.raw 

291 AACATGTTTG 1500 

E--> 293 ACGGCACAGA TAGTTGTTAC TTTCACTCTG GAGCTCGTGG TTATCATTGG 

294 ATGTGGGATT 1560 

E--> 2 96 TCCGCCTCTT TAACTATGGA AACTGGGAGG TACTTAGGTA TCTTCTCTCA 

2 97 AATGCGAGAT 162 0 

E __> 299 GGTGGTTGGA TGAGTTCAAA TTTGATGGAT TTAGATTTGA TGGTGTGACA 

300 TCAATGATGT 1680 

E--> 302 GTACTCACCA CGGATTATCG GTGGGATTCA CTGGGAACTA 

30 3 CGAGGAATAC TTTGGACTCG 17 4 0 

E--> 30 5 CAACTGATGT GGATGCTGTT GTGTATCTGA TGCTGGTCAA CGATCTTATT 

306 CATGGGCTTT 1800 

E--> 308 TCCCAGATGC AATTACCATT GGTGAAGATG TTAGCGGAAT GCCGACATTT 

309 TGTGTTCCCG 1860 

E--> 311 TTCAAGATGG GGGTGTTGGC TTTGACTATC GGCTGCATAT GGCAATTGCT 

312 GATAAATGGA 1920 

E--> 314 TTGAGTTGCT CAAGAAACGG GATGAGGATT GGAGAGTGGG 

315 TGATATTGTT CATACACTGA 1980 

E--> 317 CAAATAGAAG ATGGTCGGAA AAGTGTGTTT CATACGCTGA AAGTCATGAT 

318 CAAGCTCTAG 2040 

E--> 3 20 TCGGTGATAA AACTATAGCA TTCTGGCTGA TGGACAAGGA TATGTATGAT 

321 TTTATGGCTC 2100 

E--> 32 3 TGGATAGACC GTCAACATCA TTAATAGATC GTGGGATAGC ATTACACAAG 

324 ATGATTAGGC 2160 

E--> 326 TTGTAACTAT GGGATTAGGA GGAGAAGGGT ACCTAAATTT CATGGGAAAT 

327 GAATTCGGCC 2220 

E--> 329 ACCCTGAGTG GATTGATTTC CCTAGGGCTG AACAACACCT CTCTGATGGC 

330 TCAGTAATTC 2280 

E--> 3 32 CCAGAAACCA ATTCAGTTAT GATAAATGCA GACGGAGATT TGACCTGGGA 

333 GATGCAGAAT 2 340 

E--> 3 35 ATTTAAGATA CCGTGGGTTG CAAGAATTTG ACCGGGCTAT GCAGTATCTT 

3 36 GAAGATAAAT 24 00 
E--> 3 38 ATGAGTTTAT GACTTCAGAA CACCAGTTCA TATCACGAAA GGATGAAGGA 

3 39 GATAGGATGA 24 60 

E--> 341 TTGTATTTGA AAAAGGAAAC CTAGTTTTTG TCTTTAATTT TCACTGGACA 

342 AAAGGCTATT 2 520 

E--> 344 CAGACTATCG CATAGGCTGC CTGAAGCCTG GAAAATACAA 

345 GGTTGCCTTG GACTCAGATG 2580 

E __> 347 ATCCACTTTT TGGTGGCTTC GGGAGAATTG ATCATAATGC CGAATATTTC 

348 ACCTTTGAAG 2640 

E--> 3 50 GATGGTATGA TGATCGTCCT CGTTCAATTA TGGTGTATGC ACCTAGTAGA 

3 51 ACAGCAGTGG 27 00 

E--> 3 53 TCTATGCACT AG TAG AC AAA GAAGAAGAAG AAGAAGAAGA AGTAGCAGTA 

3 54 GTAGAAGAAG 27 60 

E--> 3 56 TAGTAGTAGA AGAAGAATGA ACGAACTTGT GATCGCGTTG AAAGATTTGA 

357 ACGCCACATA 2820 

E--> 3 59 GAGCTTCTTG ACGTATCTGG CAATATTGCA TTAGTCTTGG CGGAATTTCA 

360 TGTGACAACA 2880 

E--> 3 62 GGTTTGCAAT TCTTTCCACT ATT AG TAG TG CAACGATATA CGCAGAGATG 

363 AAGTGCTGAA 2940 
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RAW SEQUENCE LISTING DATE: 02/15/2002 

PATENT APPLICATION: US/10/0 56,4 54 TIME: 14:39:24 

Input Set : A;\EP.txt 

Output Set: N:\CRF3\02152002\J0564 54.raw 

E--> 365 CAAAAACATA TGTAAAATCG ATGAATTTAT GTCGAATGCT GGGACGATCG 

366 AATTCCTGCA 3000 

368 GCC 3003 

370 (2) INFORMATION FOR SEQ ID NO: 13: 

372 (i) SEQUENCE CHARACTERISTICS: 

373 (A) LENGTH: 2975 base pairs 

374 (B) TYPE: nucleic acid 

375 (C) STRANDEDNESS : single 

376 ( D ) TOPOLOGY: linear 

382 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

E--> 3 84 TTGATGGGCC TTGAACTCAG CAATTTGACA CTCAGTTAGT TACACTCCTA 

385 TCACTTATCA 60 

E--> 3 87 GATCTCTATT TTTTCTCTTA ATTCCAACCA GGGGAATGAA TAAAAGGATA 

388 GATTTGTAAA 120 

E--> 390 AACCCTAAGG AGAGAAGAAG AAAGATGGTG TATATACTCT CTGGAGTTCG 

391 TTTTCCTACT 180 

E--> 3 93 GTTCCATCAG TGTACAAATC TAATGGATTC AGCAGTAATG GTGATCGGAG 

394 GAATGCTAAT 240 

E--> 3 96 GTTTCTGTAT TCTTGAAAAA GCACTCTCTT TCACGGAAGA TCTTGGCTGA 

397 AAAGTCTTCT 300 

E--> 3 99 TACAATTCCG AATTCCGACC TTCTACAGTT GCAGCATCGG GGAAAGTCCT 

400 TGTGCCTGGA 360 

E--> 4 02 ACCCAGAGTG ATAGCTCCTC ATCCTCAACA GACCAATTTG AGTTCACTGA 

403 GACATCTCCA 420 

E--> 40 5 GAAAATTCCC CAGCATCAAC TGATGTAGAT AGTTCAACAA TGGAACACGC 

40 6 TAGCCAGATT 480 

E--> 408 AAAACTGAGA ACGATGACGT TGAGCCGTCA AGTGATCTTA 

40 9 CAGGAAGTGT TGAAGAGCTG 54 0 

E--> 411 GATTTTGCTT CATCACTACA ACTACAAGAA GGTGGTAAAC TGGAGGAGTC 

412 TAAAACATTA 600 

E--> 414 AATACTTCTG AAGAGACAAT TATTGATGAA TCTGATAGGA TCAGAGAGAG 

415 GGGCATCCCT 660 

E--> 417 CCACCTGGAC TTGGTCAGAA GATTTATGAA ATAGACCCCC TTTTGACAAA 

418 CTATCGTCAA 720 

E--> 420 CACCTTGATT ACAGGTATTC ACAGTACAAG AAACTGAGGG AGGCAATTGA 

421 CAAGTATGAG 780 

E--> 42 3 GGTGGTTTGG AAGCTTTTCT CGTGGTTATG AAAAAATGGG TTTCACTCGT 

424 AGTGCTACAG 840 

E--> 4 26 GTATCACTTA CCGTGAGTGG GCTCCTGGTG CCCAGTCAGC 

427 TGCCCTCATT GGAGATTTCA 900 

E--> 429 ACAATTGGGA CGCAAATGCT GACATTATGA CTCGGAATGA ATTTGGTGTC 

4 30 TGGGAGATTT 960 

E--> 4 32 TTCTGCCAAA TAATGTGGAT GGTTCTCCTG CAATTCCTCA TGGGTCCAGA 

4 33 GTGAAGATAC 1020 

E--> 4 35 GTATGGACAC TCCATCAGGT GTTAAGGATT CCATTCCTGC TTGGATCAAC 

436 TACTCTTTAC 1080 

E--> 4 38 AGCTTCCTGA TGAAATTCCA TATAATGGAA TATATTATGA TCCACCCGAA 

4 39 GAGGAGAGGT 114 0 

E--> 441 ATATCTTCCA ACACCCACGG CCAAAGAAAC CAAAGTCGCT GAGAATATAT 
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RAW SEQUENCE LISTING DATE: 02/15/2002 

PATENT APPLICATION: US/10/056,4 54 TIME: 14:39:24 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02152002\J0564 54.raw 







442 


GAATCTCATA 


1200 




E- 


- > 


444 


TTGGAATGAG 


TAGTCCGGAG 


CCTAAAATTA 






445 


GATGAAGTTC 


1260 




E - 


_ > 


447 


TTCCTCGCAT 


AAAAAAGCTT 


GGGTACAATG 






448 


CAAGAGCATT 


1320 




E - 


_ > 


4 50 


CTTATTATGC 


TAGTTTTGGT 


TATCATGTCA 






4 51 


AGCCGTTTTG 


1380 




E- 


_ > 


453 


GAACGCCCGA 


CGACCTTAAG 


TCTTCGATTG 






454 


ATTGTTGTTC 


1440 




E- 


_ > 


4 56 


TCATGGACAT 


CGTTCACAGC 


CATGCATCAA 






457 


AACATGTTTG 


1500 




E- 


_ > 


459 


ACGGCACCGA 


TAGTTGTTAC 


TTTCACTCTG 






460 


ATGTGGGATT 


1560 




E - 


_ > 


462 


CTGCCTCTTT 


AACTATGGAA 


ACTGGGAGGT 






463 


ATGCGAGATG 


1620 




E- 


_ > 


465 


GTGGTTGGAT 


GAGTTCAAAT 


TTGATGGATT 






466 


CAATGATGTA 


1680 




E - 


- > 


4 68 


TACTCACCAC 


GGATTATCGG 


TGGGATTCAC 






469 


TTGGACTCGC 


1740 




E - 


_ > 


4 71 


AAfTGATGTG 


GATGCTGTTG 


TGTATCTGAT 






472 


ATAGGCTTTT 


1800 




E - 


_ > 


4 74 


CCCAGATGCA 


ATTACCATTG 


GTGAAGATGT 






4 7 5 


GTATTCCCGT 


1860 




E- 


_ > 


477 


TCAAGATGGG 


GGTGTTGGCT 


TTGACTATCG 






478 


GCAATTGCTG 


ATAAATGGAT 


1920 


E- 


_ > 


4 80 


TGAGTTGCTC 


AAGAAACGGG 


ATGAGGATTG 






481 


GATATTGTTC 


ATACACTGAC 


1980 


E - 


_ > 


483 


AAATAGAAGA 


TGGTCGGAAA 


AGTGTGTTTC 






484 


AAGCTCTAGT 


2040 




E - 


_ > 


4 86 


CGGTGATAAA 


ACTATAGCAT 


TCTGGCTGAT 






487 


TTATGGCTCT 


2100 




E - 


_ > 


489 


GGATAGACCG 


CCAACATCAT 


TAATAGATCG 






490 


TGATTAGGCT 


2160 




E - 


_ > 


492 


TGTAACTATG 


GGATTAGGAG 


GAGAAGGGTA 






493 


AATTCGGCCA 


2220 




g _ 


_ > 


495 


rrrTCAGTGG 

v_ v_. v_ j_\?r\V3 ivju 


ATTGATTTCC 


CTAGGGCTGA 






4 96 


TAGTAATTCC 


2280 




E - 


_ > 


498 


V, vj la nnn \^ \* 


TTCAGTTATG 


ATAAATGCAG 






499 


GACCTGGGAG 


ATGCAGAATA 


2340 


E- 


> 


501 


TTTAAGATAC 


CATGGGTTAC 


AAGAATTTGA 






502 


AAGATAAATA 


2400 




E- 


-> 


504 


TGAGTTTATG 


ACTTCAGAAC 


ACCAGTTCAT 






505 


ATAGGATGAT 


2460 




E- 


> 


507 


TGTATTTGAA 


AGAGGAAACC 


TAGTTTTCGT 






508 


ATAGCTATTC 


2520 




E- 


-> 


510 


AGACTATCGC 


ATAGGCTGCC 


TGAAGCCTGG 






511 


GTTGTCTTGG 


ACTCAGATGA 


2580 


E- 


> 


513 


TCCACTTTTT 


GGTGGCTTCG 


GGAGAATTGA 






514 


CCTCTGAAGG 


2640 
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RAW SEQUENCE LISTING DATE: 02/15/2002 

PATENT APPLICATION: US/10/0 56,4 54 TIME: 14:39:24 

input Set : A:\EP.txt 

Output Set: N:\CRF3\02152002\J0564 54.raw 

E--> 516 ATCGTATGAT GATCGTCCTT GTTCAATTAT GGTGTATGCA CCTAGTAGAA 

517 CAGCAGTGGT 2700 
E--> 519 CTATGCACTA GTAGACAAAC TAGAAGTAGC AGTAGTAGAA GAACCCATTG fl/ 

520 AAGAATGAAC 2 7 60 X/** 

E--> 522 GAACTTGTGA TCGCGTTGAA AGATTTGAAC GTTACTTGGT CATCCACATA 

523 GAGCTTCTTG 2820 
E--> 52 5 ACATCAGTCT TGGCGGAATT GCATGTGACA ACAAGGTTTG CAGTTCTTTC 

526 CACTATTAGT 2880 
E--> 528 AGTCCACCGA TATACGCAGA GATGAAGTGC TGAACAAACA TATGTAAAAT 

529 CGATGAATTT 2940 
E--> 531 ATGTCGAATG CTGGGACGAT CGAATTCCTG CAGCC 
W--> 532 2975 

534 (2) INFORMATION FOR SEQ ID NO: 14: 

536 (i) SEQUENCE CHARACTERISTICS: 

537 (A) LENGTH: 3033 base pairs 

538 (B) TYPE: nucleic acid 

539 (C) STRANDEDNESS : single 

540 (D) TOPOLOGY: linear 
54 4 (ix) FEATURE: 

54 5 (A) NAME/KEY: CDS 

546 (B) LOCATION: 145 .. 2790 

549 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

E--> 551 TTGATGGGGC CTTGAACTCA GCAATTTGAC ACTCAGTTAG TTACACTCCT 

552 ATCACTTATC 60 
E--> 554 AGATCTCTAT TTTTTCTCTT AATTCCAACC AAGGAATGAA TAAAAGGATA 

555 GATTTGTAAA 12 0 

E--> 557 AACCCTAAGG AGAGAAGAAG AAAG ATG GTG TAT ACA CTC TCT GGA 

558 GTT CGT 171 

559 Met Val Tyr Thr Leu Ser Gly Val Arg 

560 1 5 
E--> 562 TTT CCT ACT GTT CCA TCA GTG TAC AAA TCT AAT GGA TTC AGC AGT 

563 AAT 219 

564 Phe Pro Thr Val Pro Ser Val Tyr Lys Ser Asn Gly Phe Ser Ser Asn 

565 10 15 20 25 
E--> 567 GGT GAT CGG AGG AAT GCT AAT GTT TCT GTA TTC TTG AAA AAG CAC 

568 TCT 267 

569 Gly Asp Arg Arg Asn Ala Asn Val Ser Val Phe Leu Lys Lys His Ser 

570 30 35 40 
E--> 572 CTT TCA CGG AAG ATC TTG GCT GAA AAG TCT TCT TAC AAT TCC GAA 

573 TTC 315 

574 Leu Ser Arg Lys lie Leu Ala Glu Lys Ser Ser Tyr Asn Ser Glu Phe 

575 45 50 55 
E--> 577 CGA CCT TCT ACA GTT GCA GCA TCG GGG AAA GTC CTT GTG CCT GGA 

578 ACC 363 

579 Arg Pro Ser Thr Val Ala Ala Ser Gly Lys Val Leu Val Pro Gly Thr 

580 60 65 70 
E--> 5 82 CAG AGT GAT AGC TCC TCA TCC TCA ACA GAC CAA TTT GAG TTC ACT 

583 GAG 411 

584 Gin Ser Asp Ser Ser Ser Ser Ser Thr Asp Gin Phe Glu Phe Thr Glu 
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585 




75 








80 










85 










E- 


- > 


587 

588 


ACA 

ACA 


TCT 


CCA GAA 

459 


AAT 


TCC 


CCA 


GCA 


TCA 


ACT 


GAT 


GTA 


GAT 


AGT 


TCA 








589 


Thr 


Ser 


Pro Glu 


Asn 


Ser 


Pro 


Ala 


Ser 


Thr 


Asp 


Val 


Asp 


Ser 


Ser 


Thr 






590 


90 








95 










100 










105 


E - 


- > 


592 

593 


ATG 

CCG 


GAA 


CAC GCT 

507 


AGC 


CAG 


ATT 


AAA 


ACT 


GAG 


AAC 


GAT 


GAC 


GTT 


GAG 








594 


Met 


Glu 


His Ala 


Ser 


Gin 


He 


Lys 


Thr 


Glu 


Asn 


Asp 


Asp 


Val 


Glu 


Pro 






595 








110 










115 










120 




E- 


- > 


597 

598 


TCA 

TCA 


AGT 


GAT CTT 

555 


ACA 


GGA 


AGT 


GTT 


GAA 


GAG 


CTG 


GAT 


TTT 


GCT 


TCA 








599 


Ser 


Ser 


Asp Leu 


Thr 


Gly 


Ser 


Val 


Glu 


Glu 


Leu 


ASp 


Phe 


Ala 


Ser 


Ser 






600 






125 










130 










135 






E- 


- > 


602 

603 


CTA 

AAT 


CAA 


CTA CAA 

603 


GAA 


GGT 


GGT 


AAA 


CTG 


GAG 


GAG 


TCT 


AAA 


ACA 


TTA 








604 


Leu 


Gin 


Leu Gin 


Glu 


Gly Gly Lys 


Leu 


Glu 


Glu 


Ser 


Lys 


Thr 


Leu 


Asn 






605 






140 








145 










150 








E- 


- > 


607 

608 


ACT 

AGG 


TCT 


GAA GAG 

651 


ACA 


ATT 


ATT 


GAT 


GAA 


TCT 


GAT 


AGG 


ATC 


AGA 


GAG 








609 


Thr 


Ser 


Glu Glu 


Thr 


He 


He 


Asp 


Glu 


Ser 


Asp 


Arg 


He 


Arg 


Glu 


Arg 






610 




155 








160 










165 










E- 


- > 


612 

613 


GGC 

CCC 


ATC 


CCT CCA 

699 


CCT 


GGA 


CTT 


GGT 


CAG 


AAG 


ATT 


TAT 


GAA 


ATA 


GAC 








614 


Gly 


He 


Pro Pro 


Pro 


Gly 


Leu 


Gly Gin 


Lys 


He 


Tyr 


Glu 


He 


Asp 


Pro 






615 


170 








175 










180 










185 


E- 


- > 


617 

618 


CTT 

TAC 


TTG 


ACA AAC 

747 


TAT 


CGT 


CAA 


CAC 


CTT 


GAT 


TAC 


AGG 


TAT 


TCA 


CAG 








619 


Leu 


Leu 


Thr Asn 


Tyr 


Arg 


Gin 


His 


Leu 


Asp 


Tyr 


Arg 


Tyr 


Ser 


Gin 


Tyr 






620 








190 










195 










200 




E- 


- > 


622 

623 


AAG 

GCC 


AAA 


CTG AGG 

795 


GAG 


GCA 


ATT 


GAC 


AAG 


TAT 


GAG 


GGT 


GGT 


TTG 


GAA 








624 


Lys 


Lys 


Leu Arg 


Glu 


Ala 


lie 


Asp 


Lys 


Tyr 


Glu 


Gly 


Gly 


Leu 


Glu 


Ala 






625 






205 










210 










215 






E- 


- > 


627 

628 


TTT 

GGT 


TCT 


CGT GGT 

843 


TAT 


GAA 


AAA 


ATG 


GGT 


TTC 


ACT 


CGT 


AGT 


GCT 


ACA 








629 


Phe 


Ser 


Arg Gly 


Tyr 


Glu 


Lys 


Met 


Gly 


Phe 


Thr 


Arg 


Ser 


Ala 


Thr 


Gly 






630 






220 








225 










230 








E- 


- > 


632 

633 


ATC 

ATT 


ACT 


TAC CGT 

891 


GAG 


TGG 


GCT 


CTT 


GGT 


GCC 


CAG 


TCA 


GCT 


GCC 


CTC 








634 


He 


Thr 


Tyr Arg 


Glu 


Trp 


Ala 


Leu 


Gly 


Ala 


Gin 


Ser 


Ala 


Ala 


Leu 


He 






635 




235 








240 










245 










E- 


-> 


637 

638 


GGA 

AAT 


GAT 


TTC AAC 

939 


AAT 


TGG 


GAC 


GCA 


AAT 


GCT 


GAC 


ATT 


ATG 


ACT 


CGG 








639 


Gly 


Asp 


Phe Asn 


Asn 


Trp 


Asp 


Ala 


Asn 


Ala 


Asp 


He 


Met 


Thr 


Arg 


Asn 






640 


250 








255 










260 










265 


E- 


-> 


642 

643 


GAA 

TCT 


TTT 


GGT GTC 

987 


TGG 


GAG 


ATT 


TTT 


CTG 


CCA 


AAT 


AAT 


GTG 


GAT 


GGT 








644 


Glu 


Phe 


Gly Val 


Trp 


Glu 


He 


Phe 


Leu 


Pro 


Asn 


Asn 


Val 


Asp 


Gly 


Ser 






645 








270 










275 










280 
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E- 


- > 


647 

648 


CCT 

CCA 


GCA 


ATT CCT 

1035 


CAT 


GGG 


TCC 


AGA 


GTG 


AAG 


ATA 


CGT 


ATG 


GAC 


ACT 








649 


Pro 


Ala 


lie Pro 


His 


Gly 


Ser 


Arg 


Val 


Lys 


He 


Arg 


Met 


Asp 


Thr 


Pro 






650 






285 










290 










295 






E - 


- > 


652 

653 


TCA 

CAG 


GGT 


GTT AAG 

1083 


GAT 


TCC 


ATT 


CCT 


GCT 


TGG 


ATC 


AAC 


TAC 


TCT 


TTA 








654 


Ser 


Gly 


Val Lys 


Asp 


Ser 


He 


Pro 


Ala 


Trp 


He 


Asn 


Tyr 


Ser 


Lea 


Gin 






655 






300 








305 










310 








E - 


_ > 


657 

658 


CTT 

GAA 


CCT 


GAT GAA 

1131 


ATT 


CCA 


TAT 


AAT 


GGA 


ATA 


CAT 


TAT 


GAT 


CCA 


CCC 








659 




Pro 


Asp Glu 


lie 


Pro 


Tvr 


Asn 


Gly 


He 


His 


Tyr 


Asp 


Pro 


Pro 


Glu 






660 




315 








320 










325 










E- 


_ > 


662 

663 


GAG 

TCG 


GAG 


AGG TAT 

1179 


ATC 


TTC 


CAA 


CAC 


CCA 


CGG 


CCA 


AAG 


AAA 


CCA 


AAG 








664 


Glu 


Glu 


Arg Tyr 


lie 


Phe 


Gin 


His 


Pro 


Arg 


Pro 


Lys 


Lys 


Pro 


Lys 


Ser 






665 


3 3 0 








335 










340 










345 


E - 


_ > 


667 

668 


CTG 

AAA 


AGA 


ATA TAT 

1227 


GAA 


TCT 


CAT 


ATT 


GGA 


ATG 


AGT 


AGT 


CCG 


GAG 


CCT 








669 




Arg 


lie Tyr 


Glu 


Ser 


His 


He 


Glv 


Met 


Ser 


Ser 


Pro 


Glu 


Pro 


Lys 






67 0 








350 










355 










360 




E - 


_ > 


672 

673 


ATT 

AAA 


AAC 


TCA TAC 

1275 


GTG 


AAT 


TTT 


AGA 


GAT 


GAA 


GTT 


CTT 


CCT 


CGC 


ATA 








674 


He 


Asn 


Ser Tyr 


Val 


Asn 


Phe 


Arg 


Asp 


Glu 


Val 


Leu 


Pro 


Arg 


He 


Lys 






675 






365 










370 










375 






E- 


_ > 


67 7 

678 


AAG 

TCT 


CTT 


GGG TAC 

1323 


AAT 


GCG 


CTG 


CAA 


ATT 


ATG 


GCT 


ATT 


CAA 


GAG 


CAT 








67 9 


Lys 




Gly Tyr 




Ala 


Leu 


Gin 


He 


Met 


Ala 


He 


Gin 


Glu 


His 


Ser 






680 






3 80 








385 










390 








E - 


_ > 


6 8 2 

683 


TAT 

AGC 


TAC 


GCT AGT 

1371 


TTT 


GGT 


TAT 


CAT 


GTC 


ACA 


AAT 


TTT 


TTT 


GCA 


CCA 








684 


Tvr 


Tvr 


Ala Ser 


Phe 


Glv 


Tvr 


His 


val 


Thr 


Asn 


Phe 


Phe 


Ala 


Pro 


Ser 






685 




395 








400 










405 










E - 


_ > 


687 

688 


AGC 

GCT 


CGT 


TTT GGA 

1419 


ACG 


CCC 


GAC 


GAC 


CTT 


AAG 


TCT 


TTG 


ATT 


GAT 


AAA 








689 


Ser 


Arg 


Phe Gly 


Thr 


Pro 


Asp 


Asp 


Leu 


Lys 


Ser 


Leu 


lie 


Asp 


Lys 


Ala 






690 


410 








415 










420 










425 


E- 


- > 


692 

693 


CAT 

GCA 


GAG 


CTA GGA 

1467 


ATT 


GTT 


GTT 


CTC 


ATG 


GAC 


ATT 


GTT 


CAC 


AGC 


CAT 








694 


His 


Glu 


Leu Gly 


lie 


Val 


Val 


Leu 


Met 


Asp 


He 


Val 


His 


Ser 


His 


Ala 






695 








430 










435 










440 




E- 


-> 


697 

698 


TCA 

AGT 


AAT 


AAT ACT 

1515 


TTA 


GAT 


GGA 


CTG 


AAC 


ATG 


TTT 


GAC 


TGC 


ACC 


GAT 








699 


Ser 


Asn 


Asn Thr 


Leu 


Asp 


Gly 


Leu 


Asn 


Met 


Phe 


Asp 


Cys 


Thr 


Asp 


Ser 






700 






445 










450 










455 






E- 


-> 


702 

703 


TGT 

TCC 


TAC 


TTT CAC 

1563 


TCT 


GGA 


GCT 


CGT 


GGT 


TAT 


CAT 


TGG 


ATG 


TGG 


GAT 








704 


Cys 


Tyr 


Phe His 


Ser 


Gly 


Ala 


Arg 


Gly 


Tyr 


His 


Trp 


Met 


Trp 


Asp 


Ser 






705 






460 








465 










470 








E- 


-> 


707 


CGC 


CTC 


TTT AAC 


TAT 


GGA 


AAC 


TGG 


GAG 


GTA 


CTT 


AGG 


TAT 


CTT 


CTC 
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708 


TCA 




1611 






























70 9 


Arg 


Leu 


Phe Asn 


Tvr 


Glv 


Asn 


Trp 


Glu 


Val 


Leu 


Arg 


Tyr 


Leu 


Leu 


Ser 






710 




47 5 








480 










485 










E - 


_ > 


712 

713 


AAT 

TTT 


GCG 


AGA TGG 

1659 


TGG 


TTG 


GAT 


GCG 


TTC 


AAA 


TTT 


GAT 


GGA 


TTT 


AGA 








714 




Ala 


Arg Trp 


T rD 


Leu 


Asp 


Ala 


Phe 


Lys 


Phe 


Asp 


Gly 


Phe 


Arg 


Phe 






715 


4 90 








495 










500 










505 


E - 


_ > 


717 

718 


GAT 

GGA 


GGT 


GTG ACA 

1707 


TCA 


ATG 


ATG 


TAT 


ATT 


CAC 


CAC 


GGA 


TTA 


TCG 


GTG 








719 


Asp 


Gly 


Val Thr 


Ser 


Met 


Met 


Tvr 


lie 


His 


His 


Gly 


Leu 


Ser 


Val 


Gly 






720 








510 










515 










520 




E - 


_ > 


722 

723 


TTC 

GAT 


ACT 


CCC AAC 

1755 


TAC 


GAG 


GAA 


TAC 


TTT 


GGA 


CTC 


GCA 


ACT 


GAT 


GTG 








724 


Phe 


Thr 






Glu 


Glu 


Tvr 


Phe 


Glv 


Leu 


Ala 


Thr 


Asp 


Val 


Asp 






725 






525 










530 










535 






E - 


_ > 


727 

728 


GCT 

TTC 


GTT 


GTG TAT 

1803 


CTG 


ATG 


CTG 


GTC 


AAC 


GAT 


CTT 


ATT 


CAT 


GGG 


CTT 








729 


Ala 


Val 


Val Tyr 


Leu 


Met 


Leu 


Val 


Asn 


Asp 


Leu 


He 


His 


Gly 


Leu 


Phe 






730 






540 








545 










550 










_ > 


7 1 "5 
/ J Z 

733 


CCA 

TTT 


GAT 


fZC A ATT 

1851 


ACC 


ATT 


GGT 


GAA 


GAT 


GTT 


AGC 


GGA 


ATG 


CCG 


ACA 








734 


Pro 




Ala lie 


Thr 


lie 


Glv 


Glu 


Asp 


Val 


Ser 


Gly 


Met 


Pro 


Thr 


Phe 






735 




555 








560 










565 










E - 


_ > 


737 

738 


TGT 

CAT 


ATT 


CCC (ZTC 

1899 


CAA 


GAG 


GGG 


GGT 


GTT 


GGC 


TTT 


GAC 


TAT 


CGG 


CTG 








7 39 




lie 


Pro Val 


Gin 


Glu 


Glv 


Glv 


Val 


Glv 


Phe 


Asp 


Tvr 


Ara 


Leu 


His 






740 


570 








575 










580 










585 


E - 


_ > 


742 

743 


ATG 

GAG 


GCA 


ATT GCT 

1947 


GAT 


AAA 


CGG 


ATT 


GAG 


TTG 


CTC 


AAG 


AAA 


CGG 


GAT 








744 


Met 


Ala 


lie Ala 


Asp 


Lys 


Arg 


lie 


Glu 


Leu 


Leu 


Lys 


Lys 


Arg 


Asp 


Glu 






74 5 








590 










595 










600 




E - 


_ > 


747 

748 


GAT 

TGG 


TGG 


AGA GTG 

1995 


GGT 


GAT 


ATT 


GTT 


CAT 


ACA 


CTG 


ACA 


AAT 


AGA 


AGA 








749 


Asp 


T rn 


Arg Val 


Glv 


Asp 


lie 


Val 


His 


Thr 


Leu 


Thr 


Asn 


Arg 


Arg 


Trp 






750 






60 5 










610 










615 








_ > 


/J* 

753 


TCG 

GTC 


GAA 


AAC TCT 

2043 


GTT 


TCA 


TAC 


GCT 


GAA 


AGT 


CAT 


GAT 


CAA 


GCT 


CTA 








7 54 


Ser 


Glu 


Lys Cys 


Val 


Ser 


Tyr 


Ala 


Glu 


Ser 


His 


Asp 


Gin 


Ala 


Leu 


Val 






755 






620 








625 










630 








E- 


-> 


757 

758 


GGT 

GAT 


GAT 


AAA ACT 

2091 


ATA 


GCA 


TTC 


TGG 


CTG 


ATG 


GAC 


AAG 


GAT 


ATG 


TAT 








759 


Gly 


Asp 


Lys Thr 


lie 


Ala 


Phe 


Trp 


Leu 


Met 


ASp 


Lys 


Asp 


Met 


Tyr 


Asp 






760 




635 








640 










645 










E- 


-> 


762 

763 


TTT 

ATA 


ATG 


GCT CTG 

2139 


GAT 


AGA 


CCG 


TCA 


ACA 


TCA 


TTA 


ATA 


GAT 


CGT 


GGG 








764 


Phe 


Met 


Ala Leu 


Asp 


Arg 


Pro 


Ser 


Thr 


Ser 


Leu 


He 


Asp 


Arg 


Gly 


He 






765 


650 








655 










660 










665 


E- 


-> 


767 

768 


GCA 

GAA 


TTG 


CAC AAG 

2187 


ATG 


ATT 


AGG 


CTT 


GTA 


ACT 


ATG 


GGA 


TTA 


GGA 


GGA 
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769 


Ala 


Leu 


His Lys 


Met 


He 


Arg 


Leu 


Val 


Thr 


Met 


Gly 


Leu 


Gly 


Gly 


Glu 






770 








670 










675 










680 




E- 


-> 


772 

773 


GGG 

ATT 


TAC 


CTA AAT 

2235 


TTC 


ATG 


GGA 


AAT 


GAA 


TTC 


GGC 


CAC 


CCT 


GAG 


TGG 








774 


Glv 


Tyr 


Leu Asn 


Phe 


Met 


Gly 


Asn 


Glu 


Phe 


Gly 


His 


Pro 


Glu 


Trp 


He 






775 






685 










690 










695 






E- 


- > 


777 

778 


GAT 

CCC 


TTC 


CCT AGG 

2283 


GCT 


GAA 


CAA 


CAC 


CTC 


TCT 


GAT 


GGC 


TCA 


GTA 


ATC 








779 


Asp 


Phe 


Pro Arg 


Ala 


Glu 


Gin 


His 


Leu 


Ser 


Asp 


Gly 


Ser 


Val 


He 


Pro 






780 






700 








705 










710 








E- 


- > 


782 

783 


GGA 

GGA 


AAC 


CAA TTC 

2331 


AGT 


TAT 


GAT 


AAA 


TGC 


AGA 


CGG 


AGA 


TTT 


GAC 


CTG 








784 


Gly 


Asn 


Gin Phe 


Ser 


Tyr 


ASp 


Lys 


Cys 


Arg 


Arg 


Arg 


Phe 


Asp 


Leu 


Gly 






785 




715 








720 










725 










E- 


-> 


787 

788 


GAT 

CCT 


GCA 


GAA TAT 

2379 


TTA 


AGA 


TAC 


CGT 


GGG 


TTG 


CAA 


GAA 


TTT 


GAC 


CGG 








789 


Asp 


Ala 


Glu Tyr 


Leu 


Arg 


Tyr 


Arg 


Gly 


Leu 


Gin 


Glu 


Phe 


ASp 


Arg 


Pro 






790 


7 30 








735 










740 










745 


E- 


- > 


792 

793 


ATG 

CAG 


CAG 


TAT CTT 

2427 


GAA 


GAT 


AAA 


TAT 


GAG 


TTT 


ATG 


ACT 


TCA 


GAA 


CAC 








794 


Met 


Gin 


Tyr Leu 


Glu 


Asp 


Lys 


Tyr 


Glu 


Phe 


Met 


Thr 


Ser 


Glu 


His 


Gin 






795 








750 










755 










760 




E- 


- > 


797 

798 


TTC 

AAA 


ATA 


TCA CGA 

2475 


AAG 


GAT 


GAA 


GGA 


GAT 


AGG 


ATG 


ATT 


GTA 


TTT 


GAA 








799 


Phe 


lie 


Ser Arg 


Lys 


Asp 


Glu 


Gly 


Asp 


Arg 


Met 


He 


Val 


Phe 


Glu 


Lys 






800 






765 










770 










775 






E - 


- > 


802 

803 


GGA 

TCA 


AAC 


CTA GTT 

2523 


TTT 


GTC 


TTT 


AAT 


TTT 


CAC 


TGG 


ACA 


AAA 


AGC 


TAT 








804 


Glv 


Asn 


Leu Val 


Phe 


Val 


Phe 


Asn 


Phe 


His 


Trp 


Thr 


Lys 


Ser 


Tyr 


Ser 






805 






780 








785 










790 








E- 


- > 


807 

808 


GAC 

TTG 


TAT 


CGC ATA 

2571 


GCC 


TGC 


CTG 


AAG 


CCT 


GGA 


AAA 


TAC 


AAG 


GTT 


GCC 








809 


Asp 


Tvr 


Arg lie 


Ala 


Cys 


Leu 


Lys 


Pro 


Gly 


Lys 


Tyr 


Lys 


Val 


Ala 


Leu 






810 




795 








800 










805 










E- 


_ > 


812 

813 


GAC 

AAT 


TCA 


GAT GAT 

2619 


CCA 


CTT 


TTT 


GGT 


GGC 


TTC 


GGG 


AGA 


ATT 


GAT 


CAT 








814 


Asp 


Ser 


Asp Asp 


Pro 


Leu 


Phe 


Gly 


Gly 


Phe 


Gly 


Arg 


He 


Asp 


His 


Asn 






815 


810 








815 










820 










825 


E- 


-> 


817 

818 


GCC 

TCA 


GAA 


TAT TTC 

2667 


ACC 


TTT 


GAA 


GGA 


TGG 


TAT 


GAT 


GAT 


CGT 


CCT 


CGT 








819 


Ala 


Glu 


Tyr Phe 


Thr 


Phe 


Glu 


Gly 


Trp 


Tyr 


Asp 


Asp 


Arg 


Pro 


Arg 


Ser 






820 








830 










835 










840 




E- 


-> 


822 

823 


ATT 

GTA 


ATG 


GTG TAT 

2715 


GCA 


CCT 


TGT 


AAA 


ACA 


GCA 


GTG 


GTC 


TAT 


GCA 


CTA 








824 


lie 


Met 


Val Tyr 


Ala 


Pro 


Cys 


Lys 


Thr 


Ala 


Val 


Val 


Tyr 


Ala 


Leu 


Val 






825 






845 










850 










855 






E- 


-> 


827 

828 


GAC 

GCA 


AAA 


GAA GAA 

2763 


GAA 


GAA 


GAA 


GAA 


GAA 


GAA 


GAA 


GAA 


GAA 


GTA 


GCA 








829 


Asp 


Lys 


Glu Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Val 


Ala 


Ala 
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830 860 865 870 

E--> 8 32 GTA GAA GAA GTA GTA GTA GAA GAA GAA TGAACGAACT TGTGATCGCG 
W--> 833 2810 

834 Val Glu Glu Val Val Val Glu Glu Glu 
W--> 835 875 880 

E--> 837 TTGAAAGATT TGAACGCTAC ATAGAGCTTC TTGACGTATC TGGCAATATT 

838 GCATCAGTCT 2870 
E--> 840 TGGCGGAATT TCATGTGACA CAAGGTTTGC AATTCTTTCC ACTATTAGTA 

841 GTGCAACGAT 2930 
E--> 84 3 ATACGCAGAG ATGAAGTGCT GAACAAACAT AT G T AAAATC GATGAATTTA 

844 TGTCGAATGC 2990 
E--> 84 6 TGGGACGATC GAATTCCTGC AGGCCGGGGG ACCCCTTAGT TCT 
W--> 847 3033 

1028 (2) INFORMATION FOR SEQ ID NO: 16: 

1030 (i) SEQUENCE CHARACTERISTICS: 

1031 (A) LENGTH: 2576 base pairs 

1032 (B) TYPE: nucleic acid ^ 

1033 (C) STRANDEDNESS : single 

1034 (D) TOPOLOGY: linear 

1040 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

E--> 1042 TCATTAAAGA GGAGAAATTA ACTATGAGAG GATCTCACCA TCACCATCAC 

104 3 CATGGGATCT 60 
E--> 104 5 TGGCTGAAAA GTCTTCTTAC AATTCCGAAT TCCGACCTTC TACAGTTGCA 

1046 GCATCGGGGA 120 
E--> 104 8 AAGTCCTTGT GCCTGGAACC CAGAGTGATA GCTCCTCATC CTCAACAAAC 

1049 CAATTTGAGT 180 
E--> 1051 TCACTGAGAC ATCTCCAGAA AATTCCCCAG CATCAACTGA TGTAGATAGT 

1052 TCAACAATGG 240 
E--> 1054 AACACGCTAG CC AG ATT AAA ACTGAGAACG ATGACGTTGA 

10 55 GCCGTCAAGT GATCTTACAG 300 
E __> 1057 GAAGTGTTGA AGAGCTGGAT TTTGCTTCAT CACTACAACT ACAAGAAGGT 

10 58 GGTAAACTGG 36 0 

E--> 1060 AGGAGTCTAA AACATTAAAT ACTTCTGAAG AGACAATTAT TGATGAATCT 

1061 GATAGGATCA 420 
E--> 1063 GAGAGAGGGG CATCCCTCCA CCTGGACTTG GTCAGAAGAT 

1064 TTATGAAATA GACCCCCTTT 480 
E--> 1066 TGACAAACTA TCGTCAACAC CTTGATTACA GGTATTCACA GTACAAGAAA 

1067 CTGAGGGAGG 54 0 

E--> 1069 CAATTGACAA GTATGAGGGT GGTTTGGAAG CTTTTTCTCG TGGTTATGAA 

10 7 0 AAAATGGGTT 600 
E--> 1072 TCACTCGTAG TGCTACAGGT ATCACTTACC GTGAGTGGGC 

107 3 TCCTGGTGCC CAGTCAGCTG 66 0 

E--> 1075 CCCTCATTGG AGATTTCAAC AATTGGGACG CAAATGCTGA CATTATGACT 

107 6 CGGAATGAAT 7 20 

E--> 1078 TTGGTGTCTG GGAGATTTTT CTGCCAAATA ATGTGGATGG TTCTCCTGCA 

1079 ATTCCTCATG 780 
E--> 1081 GGTCCAGAGT GAAGATACGT ATGGACACTC CATCAGGTGT 

1082 TAAGGATTCC ATTCCTGCTT 840 
E--> 1084 GGATCAACTA CTCTACAGCT TCCTGATGAA ATTCCATATA ATGGAATATA 
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10 8 5 


TTATGATCCA 


900 








E _ 




10 8 7 


rcrc a apapp 


AP APPTATAT 
nu ^avj Vj xnxnx 


TTTCCAACAC 


CCACGGCCAA 








108 8 


APAAACVAAA 


VJ I v.. VJ V.. J VJ t c \ 


960 






E - 


_ > 


1090 


AT AT ATP A AT 


PTP ATATTPP 


A ATP APT APT 
Aril vjnvj inu x 


CCGGAGCCTA 


AAATTAACTC 






1091 


ATACGTGAAT 


1020 








E - 


_ > 


10 93 


TTT APAP ATP 


A APTTPTTPP 


TCGCATAAAA 


AAGCTTGGGT 


ACAATGCGCT 






1094 


GCAAATTATG 


10 80 








E - 


_ ^ 


10 96 


rfTATTf A AP 


APP ATTPTTA 
nu v / \ 1 1 v J. in 


TTATGCTAGT 


TTTGGTTATC 


ATGTCACAAA 






1097 


TTTTTTTPPA 

1 1X1 ill VJ V_. cA 


1140 








E - 


_ > 


10 99 




PTTTTPP A AP 




CTTAAGTCTT 


TGATTGATAA 






1100 


noL Iv^nl vjrtvj 


1200 








E - 


_ > 




PTAPP A ATTP 
L- 1 nu IjrLH lib 


TTPTTPTPaT 
libllbl X 


PP AP ATTPTT 


CACAGCCATG 

^n^nviwn X VJ 


CATC AAAT AA 






1103 


TAPTTT APAT 
1 rAv_ 1 ± 1 nun 1 


1260 








E - 


_ > 


1105 


rp aptpaap a 

LiVjr\b IbHrttrt 


TPTTTP APPP 
X Vj X X X Vjr-\V_\»\j 


PAPPGATAGT 


TPTTAPTTTP 


ACTCTGGAGC 






110 6 




1320 








E - 


_ > 


1108 


P ATTPP ATPT 
Lnl Ibunlb X 


PPP ATTPPPP 

VJVJVJjrAX 1 V^\^ V-VJ 


PPTTTTTAAC 


TATGGAAACT 


GGGAGGTACT 






110 9 


1 t\\j\J InlLl X 


1380 








E - 


- ? 


iiii 
1 1 1 1 


blblL-AAAllj 


PP ap A TP PTP 
LbAbAlbblb 


PTTPP ATP AP 


TTP AAATTTP 


ATGGATTTAG 






1112 


ATTTPATPPT 
HI 1 lbnlbbl 


14 4 0 








E - 


- x 


iii4 
1114 


p tp a pa tp a a 

vj 1 bALA 1 


tp a tp t a Tap 

IbAlbiAlAb 


TP APP APPP A 

1 b/\l_l_r\l_ljV3rt 


TTATPPPTPP 


GATTPAPTGG 






1115 


p a apt app in 


15 00 








E - 


_ > 


1117 
111/ 


p a aTaPTTTP 

bAAinL 1 1 lb 


PaPTPPP AAP 


TP ATPTPP AT 
X bnl Vj luunl 


PPTPTTPTPT 

Vj^lUllUlUl 


ATCTGATGCT 






1118 


PPTP a A PP A T 


1560 








E - 


_ > 


112 0 


PTTaTTPaTP 
bllnl ILAlb 


PPPTTTTPPP 
bbLl 1 1 Ibbb 


AP ATPP A ATT 
nbnl bbnnl x 


APPATTGGTG 


AAGATGTTAG 






1121 


PPPA ATPPPP 


1620 








E - 


_ ^ 


1123 


ar"aTTTTPTa 


TTPPPPTTP A 
1 1 bL-bvj 1 1 br\ 


APATPPPPPT 
rtvjrt X bubbbl 


PTTPP PTTTP 

VjJ X J-VjO\^ 1. X lu 


ACTATCGGCT 






1124 


PP A T A TPPP A 
bLn x t\ 1 vjvjv_-rt. 


16 80 










_ > 


1126 


aTTPPTP 1T1 
nl IbLlbnln 


a aTPP aTTP A 


PTTPPTP A AP 
VJ X X \3 b X l^flrMs 


AAAPGGGATG 








1127 


APPATTPPAP 


A P TP PP TP AT 


1740 






E - 


_ > 


1 1 OQ 


aTTPTTPaTa 

Allbll Ln 1 /\ 


p a ptp ap a a a 

b/Ab_ 1 bALAAA 


TAP AAP ATPP 
X t\ vj nnb r\ X Vj vj 


TPGPAAAAGT 


GTGTTTCATA 

v? lu 1 1 1 ^nin 






1130 


PPPTP A A APT 


1800 










_ y 


113 2 


p a tp a tp a a p 

LnlbAlCnAb 


PTPTaPTPPP 
bib Irtb Ibljla 


TP ATA A A APT 
IbnlAAnnb X 


ATAGPATTPT 


GGCTGATGGA 






113 3 


PAAPPATATP 
b-/A£\vjvjtt lAlb 


1860 








E - 


- > 


1 i ;c 


Ta tp aTTTTa 

lAlbAllllA 


TP P PTP TP p a 
ibbblb IbilrA 


TAPAPPPPPA 


A P ATP ATT A A 


TAGATCGTGG 






I 1 

I I J D 


PATAPPATTP 
bfl 1 Avjv_.A lib 


i q o n 








E - 


_ > 


I 1 TO 

II JO 


p a p a a p a tp a 

LAtAAbAlbA 


TTaPPPTTP T 
1 lAlaVaL. 1 Ita 1 


a APT ATPPP A 


TTAGGAGGAG 
X InuVinuurnu 








1139 


A APPPTAPPT 


A A ATTTPATP 
rArArt 1 i X brA _L vj 


1980 






E - 


- y 


114 1 


It la AAA lis AA1 


11_Ij(jL.L.AL,L.L. 


tc ar*TPP itt 


P aTTTPPPTA 
bAl 1 Ibtb l/\ 








1142 


GGGCTGAACA 


ACACCTCTCT 


2040 






E- 


-> 


1144 


GATGACTCAG 


TAATTCCCGG 


AAACCAATTC 


AGTTATGATA 


AATGCAGACG 






1145 


GAGATTTGAC 


2100 








E- 


-> 


1147 


CTGGGAGATG 


CAGAATATTT 


AAGATACCGT 


GGGTTGCAAG 


AATTTGACCG 






1148 


GGCTATGCAG 


2160 








E- 


-> 


1150 


TATCTTGAAG 


ATAAATATGA 


GTTTATGACT 


TCAGAACACC 


AGTTCATATC 






1151 


ACGAAAGGAT 


2220 








E- 


-> 


1153 


GAAGGAGATA 


GGATGATTGT 


ATTTGAAAAA 


GGAAACCTAG 


TTTTTGTCTT 






1154 


TAATTTTCAC 


2280 








E- 


-> 


1156 


TGGACAAAAA 


GCTATTCAGA 


CTATCGCATA 


GGCTGCCTGA 


AGCCTGGAAA 






1157 


ATACAAGGTT 


2340 









.A 
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E--> 1159 GCCTTGGACT CAGATGATCC ACTTTTTGGT GGCTTCGGGA GAATTGATCA 

1160 TAATGCCGAA 2400 
E--> 1162 TATTTCACCT TTGAAGGATG GTATGATGAT CGTCCTCGTT CAATTATGGT 

1163 GTATGCACCT 2460 
E--> 1165 TGTAGAACAG CAGTGGTCTA TGCACTAGTA GACAAAGAAG 

1166 AAGAAGAAGA AGAAGAAGAA 2 520 
E--> 1168 GAAGAAGTAG CAGTAGTAGA AGAAGTAGTA GTAGAAGAAG 

1169 AATGAACGAA CTTGTG 2 5 76 

1171 (2) INFORMATION FOR SEQ ID NO: 17: 

1173 (i) SEQUENCE CHARACTERISTICS: 

1174 (A) LENGTH: 2529 base pairs 

1175 (B) TYPE: nucleic acid 

1176 (C) STRANDEDNESS : single 

1177 (D) TOPOLOGY: linear 

1183 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

E--> 1185 GGATGCTAAT GTTTCTGTAT TCTTGAAAAA GCACTCTCTT TCACGGAAGA 

1186 TCTTGGCTGA 60 
E--> 1188 AAAGTCTTCT TACAATTCCG AATCCCGACC TTCTACAGTT GCAGCATCGG 

1189 GGAAAGTCCT 120 
E--> 1191 TGTGCCTGGA AYCCAGAGTG ATAGCTCCTC ATCCTCAACA GACCAATTTG 

1192 AGTTCACTGA 180 
E--> 1194 GACATCTCCA GAAAATTCCC CAGCATCAAC TGATGTAGAT AGTTCAACAA 

1195 TGGAACACGC 240 
E--> 1197 TAGCCAGATT AAAACTGAGA ACGATGACGT TGAGCCGTCA AGTGATCTTA 

1198 CAGGAAGTGT 300 
E--> 1200 TGAAGAGCTG GATTTTGCTT CATCACTACA ACTACAAGAA GGTGGTAAAC 

1201 TGGAGGAGTC 360 
E--> 1203 TAAAACATTA AATACTTCTG AAGAGACAAT TATTGATGAA TCTGATAGGA 

1204 TCAGAGAGAG 4 20 

E--> 1206 GGGCATCCCT CCACCTGGAC TTGGTCAGAA GATTTATGAA 

1207 ATAGACCCCC TTTTGACAAA 4 80 

E--> 1209 CTATCGTCAA CACCTTGATT ACAGGTATTC ACAGTACAAG AAACTGAGGG 

1210 AGGCAATTGA 540 
E--> 1212 CAAGTATGAG GGTGGTTTGG AAGCTTTTTC TCGTGGTTAT GAAAAAATGG 

1213 GTTTCACTCG 600 
E--> 1215 TAGTGCTACA GGTATCACTT ACCGTGAGTG GGCTCCTGGT 

1216 GCCCAGTCAG CTGCCCTCAT 660 
E--> 1218 TGGAGATTTC AACAATTGGG ACGCAAATGC TGACATTATG ACTCGGAATG 

1219 AATTTGGTGT 720 
E--> 1221 CTGGGAGATT TTTCTGCCAA ATAATGTGGA TGGTTCTCCT GCAATTCCTC 

1222 ATGGGTCCAG 780 
E--> 1224 AGTGAAGATA CGYATGGACA CTCCATCAGG TGTTAAGGAT TCCATTCCTG 

122 5 CTTGGATCAA 840 
E--> 1227 CTACTCTTTA CAGCTTCCTG ATGAAATTCC ATATAATGGA ATATATTATG 

12 28 ATCCACCCGA 900 
E--> 1230 AGAGGAGAGG TATRTCTTCC AACACCCACG GCCAAAGAAA 

12 31 CCAAAGTCGC TGAGAATATA 960 
E--> 1233 TGAATCTCAT ATTGGAATGA GTAGTCCGGA GCCTAAAATT AACTCATACG 

1234 TGAATTTTAG 1020 
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E - 


_ > 


1236 


APATPAAGTT 


CTTCCTCGCA 






1237 


TTATGGCTAT 


1080 


E - 


_ > 


1239 


TP A APAPP AT 


TPTTATTATG 

i- v. X X AX Inl VI 






124 0 


TTGPACTAAG 


114 0 


E - 


_ > 


124 2 


PAPPPPTTTT 


GGAACGCCCG 






124 3 


ATGAGCTAGG 


1200 


E - 


_ > 


124 5 


A ATTPTTPTT 


CTCATGGACA 






124 6 


TAGATGGACT 


1260 


E - 


_ > 


124 8 


r A AfATfiTTT 


GACGGCACAG 






12 4 9 


PTT ATPATTP 


1320 


E - 


_ y 


12 51 


p nTcrrrcftT 

VjrYXVj luVjunl 


TPPCP.PPTPT 






12 5 2 


ATTTTfTrTP 

i \ 1 V— 1 1 v.. J. v.. J. v_. 


1380 


E - 


_ > 


12 54 


A A ATPPP AHA 
.riAAXviv.vi.ri via 


TPPTPPTTPP 






125 5 


ATGGTGTGAP 


1440 


E - 


_ > 


1257 


ATP A A TP A TP 
r\ x Lnn X \j a X Vj 


TATAPTP APP 






12 5 8 




1500 




_ y 


1260 


/-"TTTPP APTP 


PP A AP TP. ATP 






1261 


nV..uf i 1 1 1 / \ 1 


1560 


E - 


_ y 


1 1£1 

l/OJ 


1 v-aL-vivjvi V- x 1 


TTPPPaP ATP 






12 64 


TPPPPAPATT 


1620 




_ y 


1266 


ttptattppp 
1 1vixaxxI-.1»,V-. 


PTTP A A P. ATP 






12 67 


1 kJ\j>wrVrv 1 1 uv. 


1680 




_ y 


1269 


TP AT A A ATPP 
X uninnnl vjvj 


ATTPAPTTPP 

A X X unu 1 luL 






1270 


TGGAGAGTGG 


GTGATATTGT 




_ > 


1272 


mr n TAP APTP 


APA AATAP A A 
nLnnn X fin 






127 3 




1800 


E 


_ y 


127 5 


TP A APPTPTA 


PTPPPTPATA 
\3 X v-vivi X unln 






12 7 6 


ATATPTATPA 
AlrVXVjJ-ri-Lvji-i 


1860 




_ > 


12 7 8 


X x X InluvjLl 


PTPPATAP AP 






12 7 9 


PATTPPAPA A 


1920 


E - 


_ y 


1281 


P ATP ATTAPP 


PTTPTA APT A 
V- X Ivj 1 nriC x A 






1282 


TP A TPPP A A A 


1980 


E - 


_ > 


12 84 


TP A ATTPPPP 


P APP P TP APT 






128 5 


PARPAAPAPP 


TPTPTPATPP 


E- 


_ ^ 


1 "5 ft 7 


PTPAPTAATT 


CCCC.CZ. A A APP 






12 88 


TTP APPTPPP 
1 1 Lja^V_. 1 Livjvj 


2100 




_ y 


1290 


apaTPPap A A 


TATTTA APAT 
x AX X Ifirtbttl 






12 91 


TPPAPTATPT 


2160 


E- 


> 


1293 


TGAAGATAAA 


TATGAGTTTA 






1294 


AGGATGAAGG 


2220 


E- 


-> 


1296 


AGATAGGATG 


ATTGTATTTG 






1297 


TTCACTGGAC 


2280 


E- 


-> 


1299 


AAATAGCTAT 


TCAGACTATC 






1300 


AGGTTGGCTT 


2340 


E- 


-> 


1302 


GGACTCAGAT 


GATCCACTTT 






1303 


CCGAATATTT 


2400 


E- 


-> 


1305 


CACCTCTGAA 


GGATCGTATG 






1306 


CACCTAGTAG 


2460 


E- 


-> 


1308 


AACAGCAGTG 


GTCTATGCAC 



TGGATGCTGT TGTGTATCTG ATGCTGGTCA 
CAATTACCAT TGGTGAAGAT GTTAGCGGAA 
GGGGTGTTGG CTTTGACTAT CGGCTGCATA 

TCAAGAAACG GGATGAGGAT 

1740 

GATGGTCGGA AAAGTGTGTT TCATMCGCTG 
AAACTATAGC ATYCTGGCTG ATGGACAAGG 



2040 
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Output Set: N:\CRF3\02152002\J0564 54.raw 

1309 GAAGAAGAAG AAGAANCCGN 2 520 

1311 NGAAGAATT 2 52 9 

1313 (2) INFORMATION FOR SEQ ID NO: 18: 

1315 (i) SEQUENCE CHARACTERISTICS: 

1316 (A) LENGTH: 3231 base pairs 

1317 (B) TYPE: nucleic acid 

1318 (C) STRANDEDNESS : single 

1319 (D) TOPOLOGY: linear 

1325 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

E--> 1327 GATTTAATAC GACTCACTAT AGGGATTTTT TTTTTTTTTT TTTTAAAAAC 

1328 CTCCTCCACT 60 
E--> 1330 CAGTCTTGGG ATCTCTCTCT CTCTTCACGC TTCTCTTGGG GCCTTGAACT 

13 31 CAGCAATTTG 120 
E--> 1333 ACACTCAGTT AGTTACACTC CTATCACTCA TCAGATCTCT ATTTTTTCTC 

1334 TTAATTCCAA 180 
E--> 1336 CCAAGGAATG AATTAAAAGA TTAGATTTGA AGGAGAGAAG AAGAAAGATG 

1337 GTGTATACAC 240 
E--> 1339 TCTCTGGAGT TCGTTTTCCT ACTGTTCCAT CAGTGTACAA ATCTAATGGA 

1340 TTCAGCAGTA 300 
E--> 1342 ATGGTGATCG GAGGAATGCT AATGTTTCTG TATTCTTGAA AAAGCACTCT 

1343 CTTTCACGGA 360 
E--> 1345 AGATCTTGGC TGAAAAGTCT TCTTACGATT CCGAATCCCG ACCTTCTACA 

134 6 GTTGCAGCAT 420 
E--> 134 8 CGGGGAAAGT CCTTGTACCT GGAATCCAGA GTGATAGCTC 

1349 CTCATCCTCA ACAGACCAAT 480 
E--> 1351 TTGAGTTCAC TGAGACAGCT CCAGAAAATT CCCCAGCATC AACTGATGTG 

1352 GATAGTTCAA 540 
E--> 1354 CAATGGAACA CGCTAGCCAG ATTAAAACTG AGAACGATGA 

1355 CGTTGAGCCG TCAAGTGATC 600 
E--> 1357 TTACAGGAAG TGTTGAAGAG TTGGATTTTG CTTCATCACT ACAACTACAA 

13 58 GAAGGTGGTA 660 
E--> 1360 AACTGGAGGA GTCTAAAACA TTAAATACTT CTGAAGAGAC AATTATTGAT 

1361 GAATCTGATA 720 
E--> 1363 GGATCAGAGA GAGGGGCATC CCTCCACCTG GACTTGGTCA 

1364 GAAGATTTAT GAAATAGACC 7 80 

E--> 1366 CCCTTTTGAC AAACTATCGT CAACACCTTG ATTACAGGTA TTCACAGTAC 

1367 AAGAAAATGA 84 0 

E--> 1369 GGGAGGCAAT TGACAAGTAT GAGGGTGGTT TGGAAGCTTT 

1370 TTCTCGTGGT T AT G AAA AAA 900 
E--> 1372 TGGGTTTCAC TCGTAGTGCT ACAGGTATCA CTTACCGTGA GTGGGCTCCT 

1373 GGTGCCCAGT 960 
E--> 1375 CAGCTGCTCT CATTGGAGAT TTCAACAATT GGGACGCAAA TGCTGACATT 

1376 ATGACTCGGA 1020 
E __> 1378 ATGAATTTGG TGTCTGGGAG ATTTTTCTGC CAAATAATGT GGATGGTTCT 

1379 CCTGCAATTC 1080 
E--> 1381 CTCATGGGTC CAGAGTGAAG ATACGCATGG ACACTTCATC 

1382 AGGTGTTAAG GATTCCATTC 1140 
E __> 1334 CTGCTTGGAT CAACTACTCT TTACAGCTTC CTGATGAAAT TCCATATAAT 

1385 GGAATATATT 12 00 
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E--> 1387 ATGATCCACC CGAAGAGGAG AGGTATGTCT TCCAACACCC 

1388 ACGGCCAAAG AAACCAAAGT 1260 

E--> 1390 CGCTGAGAAT ATATGAATCT CATATTGGAA TGAGTAGTCC GGAGCCTAAA 

1391 ATTAACTCAT 1320 

E--> 1393 ACGTGAATTT TAGAGATGAA GTTCTTCCTC GCATAAAAAA CCTTGGGTAC 

1394 AATGCGGTGC 1380 

E--> 1396 AAATTATGGC TATTCAAGAG CATTCTTATT ATGCTAGTTT TGGTTATCAT 

1397 GTCACAAATT 1440 

E--> 1399 TTTTTGCACC AAGCAGCCGT TTTGGAACGC CCGACGACCT TAAGTCTTTG 

1400 ATTGATAAAG 1500 

E--> 1402 CTCATGAGCT AGGAATTGTT GTTCTCATGG ACATTGTTCA CAGCCATGCA 

14 0 3 TCAAATAATA 1560 

E--> 14 0 5 CTTTAGATGG ACTGAACATG TTTGACGGCA CAGATAGTTG TTACTTTCAC 

1406 TCTGGAGCTC 1620 

E--> 14 0 8 GTGGTTATCA TTGGATGTGG GATTCCCGCC TCTTTAACTA TGGAAACTGG 

1409 GAGGTACTTA 1680 

E--> 1411 GGTATCTTCT CTCAAATGCG AGATGGTGGT TGGATGAGTG CAAATTTGRT 

1412 GGATTTAGAT 1740 

E--> 1414 TTGATGGTGT GACATCAATG ATGTATACTC ACCACGGATT ATCGGTGGGA 

1415 TTCACTGGGA 1800 

E--> 1417 ACTACGAGGA ATACTTTGGA CTCGCAACTG ATGTRGATGC TGCCGTGTAT 

1418 CTGATGCTGG 1860 

E--> 14 2 0 CCAACGATCT TATTCATGGG CTTTTCCCAG ATGCAATTAC CATTGGTGAA 

1421 GATGTTAGCG 1920 

E--> 14 2 3 GAATGCCGAC ATTTTGTATT CCCGTTCAAG ATGGGGGTGT TGGCTTTGAC 

1424 TATCGGCTGC 1980 

E--> 1426 ATATGGCAAT TGCTGATAAA TGGATTGAGT TGCTCAAGAA ACGGGATGAG 

1427 GATTGGAGAG 2040 

E--> 1429 TGGGTGATAT TGTTCATACA CTGACAAATA GAAGATGGTC GGAAAAGTGT 

1430 GTTTCATACG 2100 

E--> 14 32 CTGAAAGTCA TGATCAAGCT CTAGTCGGTG ATAAAAC TAT AGCATTCTGG 

1433 CTGATGGACA 2160 

E--> 14 3 5 AGGATATGTA TGATTTTATG GCTTTGGATA GACCGTCAAC ATCATTAATA 

1436 GATCGTGGGA 2220 

E--> 1438 TAGCATTGCA CAAGATGATT AGGCTTGTAA CTATGGGATT AGGAGGAGAA 

1439 GGGTACCTAA 2280 

E--> 1441 ATTTCATGGG AAATGAATTC GGCCACCCTG AGTGGATTGA TTTCCCTAGG 

144 2 GCTGAACAAC 2 34 0 

E--> 1444 ACCTCTCTGA TGGCTCAGTA ATTCCCGGAA ACCAATTCAG TTATGATAAA 

14 4 5 TGCAGACGGA 2 4 00 

E--> 1447 GATTTGACCT GGGAGATGCA GAATATTTAA GATACCGTGG GTTGCAAGAA 

1448 TTTGACCGGG 2460 

E--> 1450 CTATGCAGTA TCTTGAAGAT AAATATGAGT TTATGACTTC AGAACACCAG 

1451 TTCATATCAC 2520 

E--> 14 53 GAAAGGATGA AGGAGATAGG ATGATTGTAT TTGAAAAAGG AAACCTAGTT 

1454 TTTGTCTTTA 2580 

E--> 14 56 ATTTTCACTG G AC AAAAAG C TATTCAGACT ATCGCATAGG CTGGCTGAAG 

14 57 CCTGGAAAAT 2640 

E--> 1459 ACAAGGTTGC CTTGGACTCA GATGATCCAC TTTTTGGTGG 
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1460 CTTCGGGAGA ATTGATCATA 2 7 00 

E--> 1462 ATGCCGAATG TTTCACCTTT GAAGGATGGT ATGATGATCG TCCTCGTTCA 

14 6 3 ATTATGGTGT 2760 
E--> 1465 ATGCACCTAG TAGAACAGCA GTGGTCTATG CACTAGTAGA CAAAGAAGAA 

14 6 6 GAAGAAGAAG 2 820 
E--> 1468 AAGTAGCAGT AGTAGAAGAA G TAG TAG TAG AAGAAGAATG AACGAACTTG 

1469 TGATCGCGTT 2880 
E--> 1471 GAAAGATTTG AACGCTACAT AGAGCTTCTT GACGTATCTG GCAATATTGC 

1472 ATCAGTCTTG 2940 
E--> 1474 GCGGAATTTC ATGTGACAAA AGGTTTGCAA TTCTTTCCAC TAT TAG TAG T 

147 5 GCAACGATAT 3000 
E--> 1477 AC GC AG AG AT GAAGTGCTGA ACAAACATAT GTAAAATCGA TGAATTTATG 

1478 TCGAATGCTG 3060 
E--> 1480 GGACGGGCTT CAGCAGGTTT TGCTTAGTGA GTTCTGTAAA TTGTCATCTC 

1481 TTTANATGTA 3120 
E--> 1483 CAGCCCACTA GAAATCAATT ATGTGAGACC TAAAAAACAA TAACCATAAA 

14 84 ATGGAAATAG 3180 
E--> 1486 TGCTGATCTA ATGATGTTTT AANCCNNNNA AAAAAAAAAA AAAAACTCGA 



1487 G 3231 

1489 (2) INFORMATION FOR SEQ ID NO: 19: 

1491 (i) SEQUENCE CHARACTERISTICS: 

1492 (A) LENGTH: 2578 base pairs 

1493 (B) TYPE: nucleic acid 

1494 (C) STRANDEDNESS : single 

1495 (D) TOPOLOGY: linear 

1501 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



E--> 1503 TCATTAAAGA GGAGAAATTA ACTATGAGAG GATCTCACCA TCACCATCAC 

1504 CATGGGATCT 60 
E--> 1506 TGGCTGAAAA GTCTTCTTAC AATTCCGAAT TCCGACCTTC TACAGTTGCA 

150 7 GCATCGGGGA 120 

E--> 1509 AAGTCCTTGT GCCTGGAACC CAGAGTGATA GCTCCTCATC CTCAACAAAC 

1510 CAATTTGAGT 180 

E--> 1512 TCACTGAGAC ATCTCCAGAA AATTCCCCAG CATCAACTGA TGTAGATAGT 

1513 TCAACAATGG 24 0 

E--> 1515 AACACGCTAG CCAGATTAAA AC TG AG AACG ATGACGTTGA 

1516 GCCGTCAAGT GATCTTACAG 300 

E--> 1518 GAAGTGTTGA AGAGCTGGAT TTTGCTTCAT CACTACAACT ACAAGAAGGT 

1519 GGTAAACTGG 3 60 

E--> 1521 AGGAGTCTAA AACATTAAAT ACTTCTGAAG AGACAATTAT TGATGAATCT 

1522 GATAGGATCA 420 

E--> 1524 GAGAGAGGGG CATCCCTCCA CCTGGACTTG GTCAGAAGAT 

152 5 TTATGAAATA GACCCCCTTT 4 80 

E--> 1527 TGACAAACTA TCGTCAACAC CTTGATTACA GGTATTCACA GTACAAGAAA 

152 8 CTGAGGGAGG 54 0 

E--> 1530 CAATTGACAA GTATGAGGGT GGTTTGGAAG CTTTTTCTCG TGGTTATGAA 

1531 AAAATGGGTT 600 

E--> 1533 TCACTCGTAG TGCTACAGGT ATCACTTACC GTGAGTGGGC 

15 34 TCCTGGTGCC CAGTCAGCTG 660 

E--> 1536 CCCTCATTGG AGATTTCAAC AATTGGGACG CAAATGCTGA CATTATGACT 
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15 3 7 CGGAATGAAT 720 
E--> 1539 TTGGTGTCTG GGAGATTTTT CTGCCAAATA ATGTGGATGG TTCTCCTGCA 

1540 ATTCCTCATG 780 
E--> 154 2 GGTCCAGAGT GAAGATACGT ATGGACACTC CATCAGGTGT 

154 3 TAAGGATTCC ATTCCTGCTT 840 
E--> 154 5 GGATCAACTA CTCTTCACAG CTTCCTGATG AAATTCCATA TAATGGAATA 

154 6 TATTATGATC 900 
E--> 154 8 CACCCGAAGA GGAGAGGTAT ATCTTCCAAC ACCCACGGCC 

154 9 AAAGAAACCA AAGTCGCTGA 960 
E--> 1551 GAATATATGA ATCTCATATT GGAATGAGTA GTCCGGAGCC TAAAATTAAC 

1552 TCATACGTGA 1020 
E--> 1554 ATTTTAGAGA TGAAGTTCTT CCTCGCATAA AAAAGCTTGG GTACAATGCG 

1555 GTGCAAATTA 1080 



E--> 1557 TGGCTATTCA AGAGCATTCT TATTATGCTA GTTTTGGTTA TCATGTCACA 

1558 AATTTTTTTG 1140 
E--> 1560 CACCAAGCAG CCGTTTTGGA ACGCCCGACG ACCTTAAGTC TTTGATTGAT 

1561 AAAGCTCATG 1200 




E--> 1563 AGCTAGGAAT TGTTGTTCTC ATGGACATTG TTCACAGCCA TGCATCAAAT 

1564 AATACTTTAG 1260 
E--> 1566 ATGGACTGAA CATGTTTGAC GGCACCGATA GTTGTTACTT TCACTCTGGA 

1567 GCTCGTGGTT 1320 
E--> 1569 ATCATTGGAT GTGGGATTCC CGCCTTTTTA ACTATGGAAA CTGGGAGGTA 

1570 CTTAGGTATC 1380 
E--> 1572 TTCTCTCAAA TGCGAGATGG TGGTTGGATG AGTTCAAATT TGATGGATTT 

1573 AGATTTGATG 1440 
E--> 1575 GTGTGACATC AATGATGTAT ACTCACCACG GATTATCGGT GGGATTCACT 

1576 GGGAACTACG 1500 
E--> 1578 AGGAATACTT TGGACTCGCA ACTGATGTGG ATGCTGTTGT GTATCTGATG 

157 9 CTGGTCAACG 15 60 
E--> 1581 ATCTTATTCA TGGGCTTTTC CCAGATGCAA TTACCATTGG TGAAGATGTT 

1582 AGCGGAATGC 1620 
E--> 1584 CGACATTTTG TATTCCCGTT CAAGATGGGG GTGTTGGCTT TGACTATCGG 

1585 CTGCATATGG 1680 
E--> 1587 CAATTGCTGA TAAATGGATT GAGTTGCTCA AGAAACGGGA TGAGGATTGG 

1588 AGAGTGGGTG 1740 
E--> 1590 ATATTGTTCA TACACTGACA AATAGAAGAT GGTCGGAAAA GTGTGTTTCA 

1591 TACGCTGAAA 1800 
E--> 1593 GTCATGATCA AGCTCTAGTC GGTGATAAAA CTATAGCATT CTGGCTGATG 

1594 GACAAGGATA 1860 
E--> 1596 TGTATGATTT TATGGCTCTG GATAGACCGC CAACATCATT AATAGATCGT 

15 97 GGGATAGCAT 1920 
E--> 1599 TGCACAAGAT GATTAGGCTT GTAACTATGG GATTAGGAGG 

1600 AGAAGGGTAC CTAAATTTCA 1980 
E--> 1602 TGGGAAATGA ATTCGGCCAC CCTGAGTGGA TTGATTTCCC 

160 3 TAGGGCTGAA CAACACCTCT 2 04 0 

E--> 1605 CTGATGACTC AGTAATTCCC GGAAACCAAT TCAGTTATGA TAAATGCAGA 

1606 CGGAGATTTG 2100 
E--> 1608 ACCTGGGAGA TGCAGAATAT TTAAGATACC GTGGGTTGCA AGAATTTGAC 

1609 CGGGCTATGC 2160 
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E--> 1611 AGTATCTTGA AGATAAATAT GAGTTTATGA CTTCAGAACA CCAGTTCATA 

1612 TCACGAAAGG 2 2 20 

E--> 1614 ATGAAGGAGA TAGGATGATT GTATTTGAAA AAGGAAACCT AGTTTTTGTC 

1615 TTTAATTTTC 2280 

E--> 1617 ACTGGACAAA AAGCTATTCA GACTATCGCA TAGGCTGCCT 

1618 GAAGCCTGGA AAATACAAGG 2 34 0 

E--> 1620 TTGCCTTGGA CTCAGATGAT CCACTTTTTG GTGGCTTCGG GAGAATTGAT 

16 21 CATAATGCCG 2 400 

E--> 162 3 AATATTTCAC CTTTGAAGGA TGGTATGATG ATCGTCCTCG TTCAATTATG 

1624 GTGTATGCAC 2460 

E--> 1626 CTTGTAGAAC AGCAGTGGTC TATGCACTAG TAGACAAAGA AGAAGAAGAA 

1627 GAAGAAGAAG 2520 

E--> 1629 AAGAAGAAGT AG C AG TAG T A GAAGAAGTAG TAGTAGAAGA 

16 30 AGAATGAACG AACTTGTG 2 578 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: f ^/vU .v^U - *M ALfX*** ^O^/l*^ /t+~t . 

-fftj — NArtfir National Starch and Chemical Investment tfui^ JLl^^O~^\ 

Holding Corporation * 



C B) STREET : 501 Silveiside Ruad, SuiLe 27 
CITY : Wilmington 



1 — S T A T E : Delawdi b?- 
-( E) COUNTRY : UuiLed Stales ui America 
( F) P03TAL CODE (ZIP): 19009 - 



(ii) TITLE OF INVENTION: Improvements in or Relating to Plant Starch ( 

Composition * / 

(iii) NUMBER OF SEQUENCES: 20 « Cv) couuespondojce akxds: , / 

/ msnCETT: w/ 



STREET: . 7 . 

(^44^ COMPUTER READABLE FORM: SSrt /y^^U^ u j 

(A) MEDIUM TYPE: Floppy disk Sm*"™ Si" 

(B) COMPUTER: IBM PC compatible ' yftjL^L^^ 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS /^^V « J 
* ^ (D) SOFTWARE: Patentln Release #1.0, Version #1. 30 QEPO) ) y 

, \J ^) <v0 CURJtE>fr AJTUCATION DATA : X^JL-^U Y^'** ' ^ 

J^ ' A>v ^T^ j X CAJ AJTUCATION NVWBOL X ^ / , (/ * 



(2) INFORMATION FOR SEQ ID NO: 1: cm fiunc date ^ ^ ^ ^yytiLvJ^v^l 




VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/056, 4 54 



DATE: 02/15/2002 
TIME: 14:39:25 



input Set : 
Output Set: 



A:\EP.txt 

Nj\CRF3\02152002\J056454 .raw 



13 M:220 C: Keyword misspelled or invalid format, [(F) ZIP:] 

20 M:220 C: Keyword misspelled or invalid format, [(v) COMPUTER READABLE FORM : 



6 M:200 

0 M:200 

0 M:248 

0 M:247 

0 M:249 

0 M:249 



Input :0 Counted: 50 SEQ:13 



E: Mandatory Header Field missing, [(i) APPLICANT : ] of (1) Value not provided 
E: Mandatory Header Field missing, [(A) ADDRESSEE : J of (l)(iv) 
E: Inserted missing Mandatory Header Field, [(iv) CORRESPONDENCE ADDRESS:] 
C: Inserted Optional Header Field, [(viii) ATTORNEY/AGENT INFORMATION:] 
C: Inserted Mandatory Field, [(vi) CURRENT APPLICATION DATA:] 
C: inserted Mandatory Field, [(A) APPLICATION NUMBER:] 
0 M:249 C: Inserted Mandatory Field, [(B) FILING DATE:] 
41 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:1 
218 M:254 E: No. of Bases conflict, lnput:0 Counted: 50 SEQ:12 
M:254 Repeated in SeqNo=12 
L:384 M-.254 E: No. of Bases conflict, 
M:254 Repeated in SeqNo=13 

L: 532 M: 336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 
L:551 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:14 
M:254 Repeated in SeqNo=14 

L:833 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 14 
L:835 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 14 
L:847 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 
L:1042 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:16 
M:254 Repeated in SeqNo=16 

L:1185 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:17 
M:254 Repeated in SeqNo=17 

L:1327 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:18 
M:254 Repeated in SeqNo=18 

L:1503 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:19 
M-.254 Repeated in SeqNo=19 



: 13 



: 14 
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